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Abstract 

Let V(n) be the minimum number of monochromatic 3-term arithmetic progressions 
in any 2-coloring of {1, 2, . . . , n}. We show that 

167^ 117 

-n 2 {\ + o(l)) < V(n) < ^n 2 (l + o(l)). 



32768 v w/ _ v ' ~ 2192 

As a consequence, we find that V(n) is strictly greater than the corresponding number 
for Schur triples (which is ^n 2 (l + o(l))). Additionally, we disprove the conjecture 
that V(n) = j^n 2 (l + o(l)) as well as a more general conjecture. 



1. Introduction 



At the Erdos Conference in Budapest in the summer of 1999, Ron Graham proposed the 
following $100 problem: 



Let V(n) be the minimum number of monochromatic 3-term arithmetic progres- 
sions in any 2-coloring of [l,n] = {1,2, ... ,n}. Given V(n) = (3n 2 {l + o(l)), 
determine f3. 
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This problem seems to be much more abstruse than the corresponding problem concerning 
Schur triples (see [D], [RZ], [S]). It is conjectured, and commonly believed, that f3 — ^, in 
part because of the following "folklore" conjecture. 

Conjecture. The minimum number of monochromatic solutions, in any r-coloring of [l,n], 
of YmLi c i x i = with Y^iLi Q = is equal to the value achieved by randomly coloring the 
integers in [1, n]. 

In the case of 3-term arithmetic progressions, the equation is x + y = 2z and the value 
achieved by randomly 2-coloring the integers in [1, n] is ^(l+o(l)) since there are ^(l+o(l)) 
3-term arithmetic progressions in [l,n], of which | is the expected fraction of them that are 
monochromatic under a random 2-coloring. 

The conjecture states that V(n) = ^(1 + o(l)). We show that this conjecture is false 
by proving that V(n) < + o(l)). While we do not find /?, we are able to offer fairly 
good upper and lower bounds. We do believe that the upper bound is extremely close, if not 
equal, to V(n). 

2. Preliminaries for the Lower Bound 

Let x '■ — *■ {0) 1} De an arbitrary 2-coloring. Define, for j = 0, 1, 

Sj = {x : x(x) = J, 1 < x < n}. 

Let V(So, Si) = V(n; So, Si) be the number of monochromatic 3-term arithmetic progressions 
in [1, n] under x- 

Using an approach found in [S] and [D], we let 

seSj 

which gives us 

2V(S ,Si) = J^ (fZ(x)M^ + fi(x)J(2xf) dx. 

We rewrite the integrand as 

(fo(x) + fi(x)) 2 (Ji^ + Ji^)) - (/o(x)M + /iWM) (/o(z) + fi(x)) 

-fo(x)fi(x) (M + M) 
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and interpret the integral as 

2V(S , Si) = \{(a, b, c) G [1, nf : a + b = 2c}\ 

-\{(a,b) G (S x Si) U (Si x S ) : 2b - a G [l,n}}\ 
— | {(a, b) G So x Si : a + b is even}| . 

We will now bound the size of these sets, where our equations are valid up to o(n 2 ). 

It is trivial to show that |{(a, b, c) G [l,n] 3 : a + 6 = 2c} | = ^(1 + o(l)). It is also easy 
to show that \{(a, b) G So x Si : a + b is even}| < ^(1 + o(l)) as follows. Denote this set by 
T and let r D and b a be the number of odd numbers in [1, n] of color red (in S , say) and blue 
(in Si), respectively, and let r e and b e the number of even numbers in [1, n] of color red and 
blue, respectively. Then 

\T\ = (r b + r e b e ) 

= i«r + b f + (r e + b e f - (r G 2 + r\ + b\ + ^)) 

= K(t) 2+ ^) 2 -(-o+^+r e 2 +^)) 
= i(^-(r 2 +^+r e 2 +^)) 

= T-|(-o 2 +(f-ro) 2 + r 2 +(f-r e ) 2 ) 
= f(r + r e )-(r 2 + r e 2 ). 
This function attains its maximum of ^(1 + o(l)) when r Q = r e = |. 

Next, we define 

AT+ = {( a , 6) e (5„ x Si) U (Si x S ) : 2b - a G [1, n}} . 

Our goal is to find an upper bound for \N + \ and use the following lemma, which follows 
immediately from the paragraphs above. 

Lemma 1 7/|iV + | < cn 2 (l + o(l)) ; then 

V(So,S 1 )> 1 -(^-c^ n 2 (l + o(l)). 

3. Lower Bound Calculations 

Our approach will be to consider points in the square [l,n] 2 . From the definition of iV + , 
we restrict our attention to those points (x, y) with < 2y — x < n. We also remark that 
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2y — x = n 



2y-x = 



Figure 1: Partition of the square into rectangles, for L = 16. 



since we are looking for the coefficient of the n 2 term in V(n), we will disregard points that 
contribute o{n 2 ) to V(n). 

Consider the diagram in Figure ^ We are trying to find the maximum number of dichro- 
matic pairs (a, b) that can reside inside the parallelogram bounded by the lines x = 0, x = n, 
2y — x = 0, and 2y — x = n. To this end, we cover the parallelogram by L horizontal strips 
of height £ and right triangles with dimensions ^ x j- (in Figure ^ we have L = 16). As 
such, we cover more that the parallelogram (we have right triangles outside of the parallel- 
ogram). Hence, by maximizing the number of dichromatic pairs inside the strips and the 
right triangles, we have an upper bound on the maximum number of dichromatic pairs that 
can reside inside the parallelogram. 
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Let ((i — l)]r, ij\ contain r, red elements, i = 1, 2, . . . , L. Choosing L to be even, we can 
easily write down a formula for the number of dichromatic pairs that reside in the horizontal 
strips: 

L/2 2i-l L L 

EE( r Kz~ rj ) + (z~ r O rj ) + E E ( r * (z - r ^') + (z ~ r rj ) • (1) 

i=l j=l " i=L/2+lj=2i-L 



What remains are the maximum possible number of dichromatic points in the L remaining 

1 n _n_ 

2 L2L ~ 4L ' 



triangles. For these we use the trivial bound of their areas, L x = jf. Combining this 



with (1), we have an upper bound on \N + \: 



2 L/2 2i-l 

i=i j=i 



L L 



E E {I - r,) + - r 4 ) r,) . ,2) 



i=L/2+l j=2i-L 

We present next two different techniques to effectively bound the right-hand side of (J2J). 
The first one relies on an explicit enumeration of all the critical points (for L = 16), while 
the second approach uses a procedure based on semidefinite programming. 

3.1 Enumeration Bounds for L = 16 

In this approach, all critical points in (0, ^) 16 are compared against all maximum values 
at the 3 16 — 1 boundary problems. The maximization problem has been programmed into 
Maple as a small program called PABLO and the code is available from the second author's 
website 1 . 

After running for approximately 136 hours on a 2.7GHz G5 Macintosh server, we find 
that 

|iV+| - 2^8 (1 + 0(1)) ' 
One coloring that achieves this bound is 
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Applying Lemma 1, the above result gives us the following theorem. 
Theorem 2 V{n) > j^n 2 (l + o(l)). 



3.2 Semidefinite Bounds 

A different, more powerful way of bounding \N + \ is based on semidefinite relaxations. For 
this, consider first the change of variables := -4p-T) so rj G [0, j] if and only if Xi 6 [—1, 1]. 
Then, equation (J2J) can be written as 

9 L/2 2i-l 9 L L o 

\ N i^7i + EE^( 1 -^)+ E E 

i=l i=l i=L/2+lj=2i-L 

n 2 n 2 n 2 



-4L + T-4L^ 9(X) 



where 



L/2 2i-l L L 

q(x) :— 2_. 2^ % x i x j + E/ E/ ^XiXj. 

i=l i=l i=L/2+l j=2i-L 

Our objective is to bound |iV + | from above. For this, it is clearly enough to obtain a lower 
bound of the quadratic form g(x) over [—1,1]™. This quadratic form can be represented as 
g(x) = x T Ax, where A is an L x L symmetric integer matrix, with entries Aij = Bij + B^ 
and 

' 1 if j + 1 < 2i < j + L 
otherwise. 



A useful bound for quadratic forms on the unit hypercube, used extensively in the combina- 
torial optimization literature, can be obtained as follows. 

Lemma 3 Let A be an n x n matrix and let D = diag(<ii, . . . , d n ) be a diagonal matrix, 
such that A + D is positive semidefinite. Then, for all x e [—1, x T Ax is bounded below 

b y - £?=i d i- 

Proof. Consider any vector xG [-l,lf. Since A + D is positive semidefinite it follows that 

n 

< x T (A + D)x = x T Ax + d i x l- 

i=i 

Since x 2 < 1, we have x T Ax > — £™ =1 <ijx| > — £™ =1 □ 



For any finite value of L, a suitable set of di can be found by semidefinite programming. 
For the case L = 128 we have found a particular solution (given in the Appendix) using the 
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SDP solver SeDuMi, followed by a straightforward rounding procedure (to obtain rational 
solutions). For such a solution, it can be easily verified on a computer that the 128 x 128 
rational matrix A + D is indeed positive definite. Since we have Y^h=i di = 1364, this gives 
an upper bound for \N + \ with c = + \ + = ^Hj, resulting in the lower bound (via 
Lemma 1) given in the next theorem. 

Theorem 4 V(n) > ^n 2 (l + o(l)). 



4. The Upper Bound 



Theorem 5 V(n) < ^n 2 (l + o(l)) 

Proof. Let z m = £^3, he., a string of z's of length m. Consider the coloring, using the 

m 

colors and 1, 

„ 28 „ _ 6 „ „ 28 „ _ 37 „ „ 59 „ _ 116 „ „ 116 „ ^ 59 „ „ 37 „ ^ 28 „ „ 6 „ ^ 28 „ 

548" l 548 n 0548 n lsis" 1 5*8 0^48 1 548 71 QMS 71 l 548 n O^s" l 548 n . 

It is tedious - but routine - to show that under this coloring there are 7^^r2, 2 (l + o(l)) 
monochromatic 3-term arithmetic progressions, thereby proving the theorem. □ 

The above coloring was found using a combination of computational and analytic meth- 
ods. We briefly describe these next. 

As we have seen in the previous sections, the problem can be essentially reduced to 
the minimization of the quadratic form g(x) over the unit hypercube. To understand the 
behavior of the solution, we solved instances of this problem for large values of n (n ~ 2000). 
For this, a "good" initial candidate coloring was found using the solution of the semidefinite 
relaxation, followed by a randomization procedure known as Goemans- Williamson rounding 
[GW]. The near-optimal solutions found all shared some nice structural features, essentially 
being constant over large ranges of n, with a small number of breakpoints (equal to 12 for 
most solutions). 

We then used a continuous approximation to the minimization of g(x) = x T ix, given by 

mm J J k(x,y)4)(x)(f>(y)dxdy, 

where the function <p must satisfy !</>(•) I — 1 an d the kernel k(x,y) is piecewise constant. 
Based on the numerical solutions for large n, we chose an ansatz where the function <p is 
symmetric (4>(x) = (j)(—x)) and piecewise constant on 12 different intervals. 



Because k(x,y) is piecewise constant, the objective function is a piecewise quadratic 
function of 5 variables, namely the breakpoints (5 variables rather than 12 since we are 
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assuming symmetry). It turns out that, on the partition associated with the solution obtained 
by numerical computation, this function is strictly convex and its minimum lies inside the 
partition. Solving for the (local) minimum of this quadratic function, we obtained the 
breakpoints corresponding to the solution in Theorem 5. The solution presented is thus 
"locally optimal" in the sense that no small perturbation of the breakpoints will achieve 
a better value. Of course, in principle the possibility remains that there exist solutions of 
different structure that achieve even smaller values, so the argument given is not enough to 
prove global optimality. 

The Maple code that computes this quadratic function and performs the minimization is 
also available at the location cited earlier. 

5. Remarks for Further Investigation 

Clearly, the parallelogram described at the beginning of Section 3 could be further refined 
by using larger values of L. 

For the enumeration technique in Section 3.1 this would provide sharper bounds, which 
converge to the optimal constant (3. However, since the number of points to be checked grows 
exponentially with L, there would be an enormous increase in the computational cost (for 
example, adding two more variables would increase the computing time to approximately 51 
days). A possible improvement here could be obtained by finding an upper bound on the 
triangles for which we have used the trivial bound of their area, although this would not 
help with the exponential behavior. 

For the semidefinite bounds in Section 3.2, it is relatively straightforward (and com- 
putationally feasible) to provide slightly better lower bounds by increasing the value of L. 
However, even if we let L — > +oo, the obtained bounds will likely not converge to the opti- 
mal value of P, as there seems to be an "irreducible" gap between the original problem and 
its corresponding semidefinite relaxation. While this issue is relatively well-understood for 
finite problems, it would be of interest to fully understand the situation in this infinite limit. 

Given our belief that the bound presented in Theorem 5 is sharp, perhaps the most 
promising approach would be to attempt to directly prove the (asymptotic) global optimality 
of the corresponding solution. 

Acknowledgement 

We thank Ron Graham for some background information, but more importantly for providing 
the combinatorics community with interesting and challenging problems. The first author 



On the Asymptotic Minimum Number of Monochromatic Van der Waerden Triples 



9 



thanks Bernd Sturmfels for forwarding the email (from Doron Zeilberger) that spurred this 
collaboration. 

References 



[D] B. Datskovsky, On the number of monochromatic Schur triples, Advances in Applied Math. 31 (2003), 
193-198. 

[GW] M. X. Goemans and D. P. Williamson, Improved Approximation Algorithms for Maximum Cut and 
Satisfiability Problems Using Semidefinitc Programming, Journal of the ACM 42(6) (1995), 115-1145. 

[GRR] R. Graham, V. R6dl, and A. Rucihski, On Schur Properties of Random Subsets of Integers, J. Number 
Theory 61 (1996), 388-408. 

[RZ] A. Robertson, D. Zeilberger, A 2-Coloring of [1, N] Can Have N 2 /22 + O(N) Monochromatic Schur 
Triples, But Not Less!, Elect. J. of Comb. 5(1) (1998), R19. 

[S] T. Schocn, On the Number of Monochromatic Schur Triples, Europ. J. of Comb. 20(8) (1999), 855-866. 



Appendix 

A particular solution for the di in Lemma 3 is given by the numbers below. 
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